Automatic Pattern Embedding in Protein Structure Models
نویسندگان
چکیده
sequence-to-model alignments. Although DSMs are generally represented as hidden Markov models (HMMs), they are designed around a set of rules rather than trained on a large representative set, as most HMMs are. The DSM approach to protein structure modeling has many advantages: flexibility, generality, a high degree of abstraction, and the possibility to build models for hypothetical structures. A few experts—using their understanding of the protein structure families— designed the originally proposed models. However, with the number of deposited protein structures in the protein databank (PDB) surpassing 15,000 and the number of classified protein folds approaching 600, updating DSM libraries through this approach has become difficult. Our method constructs DSMs automatically from determined protein structures. It inherits many merits from the original DSMs, such as dividing structural states into several classes and using expert prior structural knowledge for model design to reduce the model’s free parameters.2 The DSM design’s modularity lets us combine a functional family’s sequence motif with the structural model.3 Ample evidence shows that including sequence information considerably improves fold recognition in other threading approaches.4 Most methods of combining sequence and structural information use some measure of sequence similarity over the entire length of the sequence and the structural model. Combining a minimal (few residues) functional pattern with a structural DSM can provide a sensitive method to identify functional homologs that have similar structures but no detectable overall sequence similarity.2,3 Previously, we had to select and embed those functional patterns in the models manually. In this article, we present a set of rules for automated minimal-pattern selection and embedding.
منابع مشابه
Automatic classification of highly related Malate Dehydrogenase and L-Lactate Dehydrogenase based on 3D-pattern of active sites
Accurate protein function prediction is an important subject in bioinformatics, especially wheresequentially and structurally similar proteins have different functions. Malate dehydrogenaseand L-lactate dehydrogenase are two evolutionary related enzymes, which exist in a widevariety of organisms. These enzymes are sequentially and structurally similar and sharecommon active site residues, spati...
متن کاملMetadata Enrichment for Automatic Data Entry Based on Relational Data Models
The idea of automatic generation of data entry forms based on data relational models is a common and known idea that has been discussed day by day more than before according to the popularity of agile methods in software development accompanying development of programming tools. One of the requirements of the automation methods, whether in commercial products or the relevant research projects, ...
متن کاملFouling mechanisms during protein microfiltration: The effects of protein structure and filtration pressure on polypropylene microporous membrane performance
A polypropylene microporous membrane (PPMM) was fabricated by thermally induced phase separation (TIPS) method. The effects of protein size and structure as well as filtration pressure on the membrane performance and fouling mechanisms were investigated using two different proteins, bovine serum albumin (BSA) and collagen, in dead-end filtration setup. Obtained results showed that, for each pro...
متن کاملModel Based Method for Determining the Minimum Embedding Dimension from Solar Activity Chaotic Time Series
Predicting future behavior of chaotic time series system is a challenging area in the literature of nonlinear systems. The prediction's accuracy of chaotic time series is extremely dependent on the model and the learning algorithm. On the other hand the cyclic solar activity as one of the natural chaotic systems has significant effects on earth, climate, satellites and space missions. Several m...
متن کاملBioinformatics Analysis of Upstream Region and Protein Structure of Fungal Phytase Gene
Phytase increases the bioavailability of phytate phosphorus in seed-based animal feeds and reduces the phosphorus pollution of animal waste. Since most animal feeds for pellets are heated up to 65-80 °C, the production of a thermostable structure for phytase can be useful. In this study, we sought to perform bioinformatics analysis of the upstream region and protein structure of fungal phytase ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Intelligent Systems
دوره 16 شماره
صفحات -
تاریخ انتشار 2001